Search CORE

122 research outputs found

How Does Imperfect Automatic Indexing Affect Semantic Search Performance?

Author: Gotz David
Guo Mengtian
Wang Yue
Publication venue
Publication date: 08/04/2023
Field of study

Documents in the health domain are often annotated with semantic concepts (i.e., terms) from controlled vocabularies. As the volume of these documents gets large, the annotation work is increasingly done by algorithms. Compared to humans, automatic indexing algorithms are imperfect and may assign wrong terms to documents, which affect subsequent search tasks where queries contain these terms. In this work, we aim to understand the performance impact of using imperfectly assigned terms in Boolean semantic searches. We used MeSH terms and biomedical literature search as a case study. We implemented multiple automatic indexing algorithms on real-world Boolean queries that consist of MeSH terms, and found that (1) probabilistic logic can handle inaccurately assigned terms better than traditional Boolean logic, (2) query-level performance is mostly limited by lowest-performing terms in a query, and (3) mixing a small amount of human indexing with automatic indexing can regain excellent query-level performance. These findings provide important implications for future work on automatic indexing.Comment: 9 pages, 4 figures, HealthNLP 202

arXiv.org e-Print Archive

Normalization of Process Safety Metrics

Author: Wang Mengtian
Publication venue
Publication date
Field of study

This study is aimed at exploring new process safety metrics for measuring the process safety performance in processing industries. Following a series of catastrophic incidents such as the Bhopal chemical tragedy (1984) and Phillips 66 explosion (1989), process safety became a more important subject than ever. These incidents triggered the development and promulgation of the Process Safety Management (PSM) standard in 1992. While PSM enables management to optimize their process safety programs and organizational risks, there is an emerging need to evaluate the process safety implementation across an organization through measurements. Thus, the process safety metric is applied as a powerful tool that measures safety activities, status, and performance within PSM. In this study, process safety lagging metrics were introduced to describe the contribution of process related parameters in determining the safety performance of an organization. Lagging metrics take process safety incidents as the numerator and divide it by different process-related denominators. Currently a process lagging metric (uses work hours as denominator) introduced by the Center for Chemical Process Safety (CCPS) has been used to evaluate the safety performance in processing industries. However, this lagging metric doesn't include enough process safety information. Therefore, modified denominators are proposed in this study and compared with the existing time-based denominator to validate the effectiveness and applicability of the new metrics. Each proposed metric was validated using available industry data. Statistical unitization method has converted incident rates of different ranges for the convenience of comparison. Trend line analysis was the key indication for determining the appropriateness of new metrics. Results showed that some proposed process-related metrics have the potential as alternatives, along with the time-based metric, to evaluate process safety performance within organizations

Texas A&M Repository

Intuitionistic linguistic multi-attribute decision making algorithm based on integrated distance measure

Author: Jun Liu
Meiling Liu
Mengtian Wang
Pan Xu
Shouzhen Zeng
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2019
Field of study

This study aims to integrate the intuitionistic linguistic multi-attribute decision making (MADM) method which builds upon an integrated distance measure into supplier evaluation and selection problems. More specifically, an intuitionistic linguistic integrated distance measure based on ordered weighted averaging operator (OWA) and weighted average approach is presented and applied. The desirable characteristics and families of the developed distance operator are further explored. In addition, based on the proposed distance measure, a supplier selection problem for an automobile factory is used to test the practicality of its framework. The effectiveness and applicability of the presented framework for supplier selection are examined by carrying comparative analysis against the existing techniques of aggregation

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Clustering Analysis of User Loyalty Based on K-means

Author: Fang Qiushui
Leng Mengtian
Li Zhiming
Wang Zhen
Wu Jincheng
Publication venue: 'Bilingual Publishing Co.'
Publication date: 29/10/2020
Field of study

In recent years, the rise of machine learning has made it possible to further explore large data in various fields. In order to explore the attributes of loyalty of public transport travelers and divide these people into different clustering clusters, this paper uses K-means clustering algorithm (K-means) to cluster the holding time, recharge amount and swiping frequency of bus travelers. Then we use Kernel Density Estimation Algorithms (KDE) to analyze the density distribution of the data of holding time, recharge amount and swipe frequency, and display the results of the two algorithms in the way of data visualization. Finally, according to the results of data visualization, the loyalty of users is classified, which provides theoretical and data support for public transport companies to determine the development potential of users

Bilingual Publishing Co. (BPC): E-Journals

The application and sustainable development of coral in traditional medicine and its chemical composition, pharmacology, toxicology, and clinical research

Author: Mengtian Han
Mengtian Han
Yinglian Song
Yinglian Song
Yiye Li
Yiye Li
Zhang Wang
Zhang Wang
Zhongyuan Wang
Zhongyuan Wang
Publication venue: Frontiers Media S.A.
Publication date: 01/01/2024
Field of study

This review discusses the variety, chemical composition, pharmacological effects, toxicology, and clinical research of corals used in traditional medicine in the past two decades. At present, several types of medicinal coral resources are identified, which are used in 56 formulas such as traditional Chinese medicine, Tibetan medicine, Mongolian medicine, and Uyghur medicine. A total of 34 families and 99 genera of corals are involved in medical research, with the Alcyoniidae family and Sarcophyton genus being the main research objects. Based on the structural types of compounds and the families and genera of corals, this review summarizes the compounds primarily reported during the period, including terpenoids, steroids, nitrogen-containing compounds, and other terpenoids dominated by sesquiterpene and diterpenes. The biological activities of coral include cytotoxicity (antitumor and anticancer), anti-inflammatory, analgesic, antibacterial, antiviral, immunosuppressive, antioxidant, and neurological properties, and a detailed summary of the mechanisms underlying these activities or related targets is provided. Coral toxicity mostly occurs in the marine ornamental soft coral Zoanthidae family, with palytoxin as the main toxic compound. In addition, nonpeptide neurotoxins are extracted from aquatic corals. The compatibility of coral-related preparations did not show significant acute toxicity, but if used for a long time, it will still cause toxicity to the liver, kidneys, lungs, and other internal organs in a dose-dependent manner. In clinical applications, individual application of coral is often used as a substitute for orthopedic materials to treat diseases such as bone defects and bone hyperplasia. Second, coral is primarily available in the form of compound preparations, such as Ershiwuwei Shanhu pills and Shanhu Qishiwei pills, which are widely used in the treatment of neurological diseases such as migraine, primary headache, epilepsy, cerebral infarction, hypertension, and other cardiovascular and cerebrovascular diseases. It is undeniable that the effectiveness of coral research has exacerbated the endangered status of corals. Therefore, there should be no distinction between the advantages and disadvantages of listed endangered species, and it is imperative to completely prohibit their use and provide equal protection to help them recover to their normal numbers. This article can provide some reference for research on coral chemical composition, biological activity, chemical ecology, and the discovery of marine drug lead compounds. At the same time, it calls for people to protect endangered corals from the perspectives of prohibition, substitution, and synthesis

Directory of Open Access Journals

An Empirical Analysis of Range for 3D Object Detection

Author: Hays James
Li Mengtian
Peri Neehar
Ramanan Deva
Wang Yu-Xiong
Wilson Benjamin
Publication venue
Publication date: 08/08/2023
Field of study

LiDAR-based 3D detection plays a vital role in autonomous navigation. Surprisingly, although autonomous vehicles (AVs) must detect both near-field objects (for collision avoidance) and far-field objects (for longer-term planning), contemporary benchmarks focus only on near-field 3D detection. However, AVs must detect far-field objects for safe navigation. In this paper, we present an empirical analysis of far-field 3D detection using the long-range detection dataset Argoverse 2.0 to better understand the problem, and share the following insight: near-field LiDAR measurements are dense and optimally encoded by small voxels, while far-field measurements are sparse and are better encoded with large voxels. We exploit this observation to build a collection of range experts tuned for near-vs-far field detection, and propose simple techniques to efficiently ensemble models for long-range detection that improve efficiency by 33% and boost accuracy by 3.2% CDS.Comment: Accepted to ICCV 2023 Workshop - Robustness and Reliability of Autonomous Vehicles in the Open-Worl

arXiv.org e-Print Archive

Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation

Author: Cao Shengcao
Gui Liang-Yan
Hays James
Li Mengtian
Ramanan Deva
Wang Yi-Xiong
Publication venue
Publication date: 17/08/2023
Field of study

Resource-constrained perception systems such as edge computing and vision-for-robotics require vision models to be both accurate and lightweight in computation and memory usage. While knowledge distillation is a proven strategy to enhance the performance of lightweight classification models, its application to structured outputs like object detection and instance segmentation remains a complicated task, due to the variability in outputs and complex internal network modules involved in the distillation process. In this paper, we propose a simple yet surprisingly effective sequential approach to knowledge distillation that progressively transfers the knowledge of a set of teacher detectors to a given lightweight student. To distill knowledge from a highly accurate but complex teacher model, we construct a sequence of teachers to help the student gradually adapt. Our progressive strategy can be easily combined with existing detection distillation mechanisms to consistently maximize student performance in various settings. To the best of our knowledge, we are the first to successfully distill knowledge from Transformer-based teacher detectors to convolution-based students, and unprecedentedly boost the performance of ResNet-50 based RetinaNet from 36.5% to 42.0% AP and Mask R-CNN from 38.2% to 42.5% AP on the MS COCO benchmark.Comment: ICML 202

arXiv.org e-Print Archive

Networked Time Series Prediction with Incomplete Data

Author: Gao Feng
Huang Jianqiang
Jiang Bo
Jin Haiming
Lin Tao
Wang Xinbing
Zhang Mengtian
Zhu Yichen
Publication venue
Publication date: 03/07/2023
Field of study

A networked time series (NETS) is a family of time series on a given graph, one for each node. It has a wide range of applications from intelligent transportation, environment monitoring to smart grid management. An important task in such applications is to predict the future values of a NETS based on its historical values and the underlying graph. Most existing methods require complete data for training. However, in real-world scenarios, it is not uncommon to have missing data due to sensor malfunction, incomplete sensing coverage, etc. In this paper, we study the problem of NETS prediction with incomplete data. We propose NETS-ImpGAN, a novel deep learning framework that can be trained on incomplete data with missing values in both history and future. Furthermore, we propose Graph Temporal Attention Networks, which incorporate the attention mechanism to capture both inter-time series and temporal correlations. We conduct extensive experiments on four real-world datasets under different missing patterns and missing rates. The experimental results show that NETS-ImpGAN outperforms existing methods, reducing the MAE by up to 25%

arXiv.org e-Print Archive